522 research outputs found

    The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning

    Full text link
    Language models (LMs) with less than 100B parameters are known to perform poorly on chain-of-thought (CoT) reasoning in contrast to large LMs when solving unseen tasks. In this work, we aim to equip smaller LMs with the step-by-step reasoning capability by instruction tuning with CoT rationales. In order to achieve this goal, we first introduce a new instruction-tuning dataset called the CoT Collection, which augments the existing Flan Collection (including only 9 CoT tasks) with additional 1.84 million rationales across 1,060 tasks. We show that CoT fine-tuning Flan-T5 (3B & 11B) with CoT Collection enables smaller LMs to have better CoT capabilities on unseen tasks. On the BIG-Bench-Hard (BBH) benchmark, we report an average improvement of +4.34% (Flan-T5 3B) and +2.60% (Flan-T5 11B), in terms of zero-shot task accuracy. Furthermore, we show that instruction tuning with CoT Collection allows LMs to possess stronger few-shot learning capabilities on 4 domain-specific tasks, resulting in an improvement of +2.24% (Flan-T5 3B) and +2.37% (Flan-T5 11B), even outperforming ChatGPT utilizing demonstrations until the max length by a +13.98% margin. Our code, the CoT Collection data, and model checkpoints are publicly available.Comment: EMNLP 2023 (Main Conference

    Spatially Explicit Data: Stewardship and Ethical Challenges in Science

    Get PDF
    Scholarly communication is at an unprecedented turning point created in part by the increasing saliency of data stewardship and data sharing. Formal data management plans represent a new emphasis in research, enabling access to data at higher volumes and more quickly, and the potential for replication and augmentation of existing research. Data sharing has recently transformed the practice, scope, content, and applicability of research in several disciplines, in particular in relation to spatially specific data. This lends exciting potentiality, but the most effective ways in which to implement such changes, particularly for disciplines involving human subjects and other sensitive information, demand consideration. Data management plans, stewardship, and sharing, impart distinctive technical, sociological, and ethical challenges that remain to be adequately identified and remedied. Here, we consider these and propose potential solutions for their amelioration

    Antenatal risk factors for peanut allergy in children

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Prenatal factors may contribute to the development of peanut allergy. We evaluated the risk of childhood peanut allergy in association with pregnancy exposure to Rh immune globulin, folic acid and ingestion of peanut-containing foods.</p> <p>Methods</p> <p>We conducted a web-based case-control survey using the Anaphylaxis Canada Registry, a pre-existing database of persons with a history of anaphylaxis. A total of 1300 case children with reported peanut allergy were compared to 113 control children with shellfish allergy. All were evaluated for maternal exposure in pregnancy to Rh immune globulin and folic acid tablet supplements, as well as maternal avoidance of dietary peanut intake in pregnancy.</p> <p>Results</p> <p>Receipt of Rh immune globulin in pregnancy was not associated with a higher risk of peanut allergy (odds ratio [OR] 0.86, 95% confidence interval [CI] 0.51 to 1.45), nor was initiation of folic acid tablet supplements before or after conception (OR 0.53, 95% CI 0.19 to 1.48). Complete avoidance of peanut-containing products in pregnancy was associated with a non-significantly lower risk of peanut allergy (OR 0.53, 95% CI 0.27 to 1.03).</p> <p>Conclusion</p> <p>The risk of childhood peanut allergy was not modified by the following common maternal exposures in pregnancy: Rh immune globulin, folic acid or peanut-containing foods.</p> <p>Clinical implications</p> <p>Rh immune globulin, folic acid supplement use and peanut avoidance in pregnancy have yet to be proven to modulate the risk of childhood anaphylaxis to peanuts.</p> <p>Capsule Summary</p> <p>Identification of prenatal factors that contribute to peanut allergy might allow for prevention of this life-threatening condition. This article explores the role of three such factors.</p

    Estimating the Power of Indirect Comparisons: A Simulation Study

    Get PDF
    Indirect comparisons are becoming increasingly popular for evaluating medical treatments that have not been compared head-to-head in randomized clinical trials (RCTs). While indirect methods have grown in popularity and acceptance, little is known about the fragility of confidence interval estimations and hypothesis testing relying on this method.We present the findings of a simulation study that examined the fragility of indirect confidence interval estimation and hypothesis testing relying on the adjusted indirect method.Our results suggest that, for the settings considered in this study, indirect confidence interval estimation suffers from under-coverage while indirect hypothesis testing suffers from low power in the presence of moderate to large between-study heterogeneity. In addition, the risk of overestimation is large when the indirect comparison of interest relies on just one trial for one of the two direct comparisons.Indirect comparisons typically suffer from low power. The risk of imprecision is increased when comparisons are unbalanced

    The gray matter volume of the amygdala is correlated with the perception of melodic intervals: a voxel-based morphometry study

    Get PDF
    Music is not simply a series of organized pitches, rhythms, and timbres, it is capable of evoking emotions. In the present study, voxel-based morphometry (VBM) was employed to explore the neural basis that may link music to emotion. To do this, we identified the neuroanatomical correlates of the ability to extract pitch interval size in a music segment (i.e., interval perception) in a large population of healthy young adults (N = 264). Behaviorally, we found that interval perception was correlated with daily emotional experiences, indicating the intrinsic link between music and emotion. Neurally, and as expected, we found that interval perception was positively correlated with the gray matter volume (GMV) of the bilateral temporal cortex. More important, a larger GMV of the bilateral amygdala was associated with better interval perception, suggesting that the amygdala, which is the neural substrate of emotional processing, is also involved in music processing. In sum, our study provides one of first neuroanatomical evidence on the association between the amygdala and music, which contributes to our understanding of exactly how music evokes emotional responses

    Web Queries as a Source for Syndromic Surveillance

    Get PDF
    In the field of syndromic surveillance, various sources are exploited for outbreak detection, monitoring and prediction. This paper describes a study on queries submitted to a medical web site, with influenza as a case study. The hypothesis of the work was that queries on influenza and influenza-like illness would provide a basis for the estimation of the timing of the peak and the intensity of the yearly influenza outbreaks that would be as good as the existing laboratory and sentinel surveillance. We calculated the occurrence of various queries related to influenza from search logs submitted to a Swedish medical web site for two influenza seasons. These figures were subsequently used to generate two models, one to estimate the number of laboratory verified influenza cases and one to estimate the proportion of patients with influenza-like illness reported by selected General Practitioners in Sweden. We applied an approach designed for highly correlated data, partial least squares regression. In our work, we found that certain web queries on influenza follow the same pattern as that obtained by the two other surveillance systems for influenza epidemics, and that they have equal power for the estimation of the influenza burden in society. Web queries give a unique access to ill individuals who are not (yet) seeking care. This paper shows the potential of web queries as an accurate, cheap and labour extensive source for syndromic surveillance

    Lack of awareness of erectile dysfunction in many men with risk factors for erectile dysfunction

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Men with erectile dysfunction often have concurrent medical conditions. Conversely, men with these conditions may also have underlying erectile dysfunction. The prevalence of unrecognized erectile dysfunction in men with comorbidities commonly associated with erectile dysfunction was determined in men invited to participate in a double-blind, randomized, placebo-controlled trial of sildenafil citrate.</p> <p>Methods</p> <p>Men ≥30 years old presenting with ≥1 erectile dysfunction risk factor (controlled hypertension, hypercholesterolemia, smoking, metabolic syndrome, stable coronary artery disease, diabetes, depression, lower urinary tract symptoms, obesity [body mass index ≥30 kg/m<sup>2</sup>] or waist circumference ≥40 inches), and not previously diagnosed with erectile dysfunction were evaluated. The screening question, "Do you have erectile dysfunction?," with responses of "no," "yes," and "unsure," and the Erectile Function domain of the International Index of Erectile Function (IIEF-EF) were administered.</p> <p>Results</p> <p>Of 1084 men screened, 1053 answered the screening question and also had IIEF-EF scores. IIEF-EF scores indicating erectile dysfunction occurred in 71% (744/1053), of whom 54% (399/744) had moderate or severe erectile dysfunction. Of 139 answering "yes," 526 answering "unsure," and 388 answering "no," 96%, 90%, and 36%, respectively, had some degree of erectile dysfunction. The mean±SD (range) number of risk factors was 2.9 ± 1.7 (3-8) in the "yes" group, 3.2 ± 1.7 (3-9) in the "unsure" group, and 2.6 ± 1.5 (2-8) in the "no" group.</p> <p>Conclusion</p> <p>Although awareness of having erectile dysfunction was low, most men with risk factors had IIEF-EF scores indicating erectile dysfunction. Erectile dysfunction should be suspected and assessed in men with risk factors, regardless of their apparent level of awareness of erectile dysfunction.</p> <p>Trial registration</p> <p>ClinicalTrials.gov Identifier NCT00343200.</p

    Evaluation and Management of Anal Intraepithelial Neoplasia in HIV-Negative and HIV-Positive Men Who Have Sex with Men

    Get PDF
    The incidence of human papillomavirus (HPV)–associated anal cancer in men who have sex with men (MSM) is striking and has not been mitigated by the use of highly active antiretroviral therapy. Detection and treatment of high-grade anal intraepithelial neoplasia (HGAIN) may reduce the incidence of anal cancer. Anal cytology is a useful tool to detect HGAIN; annual screening of HIV-positive MSM and biennial screening of HIV-negative MSM appears to be cost-effective. MSM with abnormal cytology should be referred for high-resolution anoscopy and biopsy. Individuals with HGAIN should receive treatment; treatment modalities for HGAIN demonstrate moderate efficacy and are usually well tolerated, but greater study is required to determine which treatment is optimal. Large prospective studies are needed to document the efficacy of screening and treatment of HGAIN on anal cancer incidence. The HPV vaccine holds promise for primary prevention of anal cancer in MSM, but significant implementation challenges remain

    Telephone Triage Service Data for Detection of Influenza-Like Illness

    Get PDF
    Background: Surveillance for influenza and influenza-like illness (ILI) is important for guiding public health prevention programs to mitigate the morbidity and mortality caused by influenza, including pandemic influenza. Nontraditional sources of data for influenza and ILI surveillance are of interest to public health authorities if their validity can be established. Methods/Principal Findings: National telephone triage call data were collected through automated means for purposes of syndromic surveillance. For the 17 states with at least 500,000 inhabitants eligible to use the telephone triage services, call volume for respiratory syndrome was compared to CDC weekly number of influenza isolates and percentage of visits to sentinel providers for ILI. The degree to which the call data were correlated with either CDC viral isolates or sentinel provider percentage ILI data was highly variable among states. Conclusions: Telephone triage data in the U.S. are patchy in coverage and therefore not a reliable source of ILI surveillance data on a national scale. However, in states displaying a higher correlation between the call data and the CDC data, call data may be useful as an adjunct to state-level surveillance data, for example at times when sentinel surveillance is not in operation or in areas where sentinel provider coverage is considered insufficient. Sufficient population coverage, a specific ILI syndrome definition, and the use of a threshold of percentage of calls that are for ILI would likely improve the utility of such data for ILI surveillance purposes
    corecore